Methods for time series analysis of RNA-seq data with application to human Th17 cell differentiation
نویسندگان
چکیده
MOTIVATION Gene expression profiling using RNA-seq is a powerful technique for screening RNA species' landscapes and their dynamics in an unbiased way. While several advanced methods exist for differential expression analysis of RNA-seq data, proper tools to anal.yze RNA-seq time-course have not been proposed. RESULTS In this study, we use RNA-seq to measure gene expression during the early human T helper 17 (Th17) cell differentiation and T-: cell activation (Th0). To quantify Th17-: specific gene expression dynamics, we present a novel statistical methodology, DyNB, for analyzing time-course RNA-seq data. We use non-parametric Gaussian processes to model temporal correlation in gene expression and combine that with negative binomial likelihood for the count data. To account for experiment-: specific biases in gene expression dynamics, such as differences in cell differentiation efficiencies, we propose a method to rescale the dynamics between replicated measurements. We develop an MCMC sampling method to make inference of differential expression dynamics between conditions. DyNB identifies several known and novel genes involved in Th17 differentiation. Analysis of differentiation efficiencies revealed consistent patterns in gene expression dynamics between different cultures. We use qRT-PCR to validate differential expression and differentiation efficiencies for selected genes. Comparison of the results with those obtained via traditional timepoint-: wise analysis shows that time-course analysis together with time rescaling between cultures identifies differentially expressed genes which would not otherwise be detected. AVAILABILITY An implementation of the proposed computational methods will be available at http://research.ics.aalto.fi/csb/software/
منابع مشابه
I-13: Transcriptome Dynamics of Human and Mouse Preimplantation Embryos Revealed by Single Cell RNA-Sequencing
Background: Mammalian preimplantation development is a complex process involving dramatic changes in the transcriptional architecture. However, it is still unclear about the crucial transcriptional network and key hub genes that regulate the proceeding of preimplantation embryos. Materials and Methods: Through single-cell RNAsequencing (RNA-seq) of both human and mouse preimplantation embryos, ...
متن کاملApproximate inference of gene regulatory network models from RNA-Seq time series data
Inference of gene regulatory network structures from RNA-Seq data is challenging due to the nature of the data, as measurements take the form of counts of reads mapped to a given gene. Here we present a model for RNA-Seq time series data that applies a negative binomial distribution for the observations, and uses sparse regression with a horseshoe prior to learn a dynamic Bayesian network of in...
متن کاملGenome-wide Analysis of STAT3-Mediated Transcription during Early Human Th17 Cell Differentiation.
The development of therapeutic strategies to combat immune-associated diseases requires the molecular mechanisms of human Th17 cell differentiation to be fully identified and understood. To investigate transcriptional control of Th17 cell differentiation, we used primary human CD4+ T cells in small interfering RNA (siRNA)-mediated gene silencing and chromatin immunoprecipitation followed by mas...
متن کاملI-42: Origins and Differentiation of Somatic Progenitors of The Mammalian Gonad Revealed by Single Cell RNA-Seq
Background - MaterialsAndMethods N;Results N;Conclusion N;
متن کاملA Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data
Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...
متن کامل